Strong Robustness of Randomized 
Rumor Spreading Protocols* 

Benjamin Doerr^ Anna Huber 
Max-Planck-Institut fiir Informatik 
Saarbriicken, Germany 

Ariel Levavi* 
University of California, San Diego 

March 11, 2010 



Abstract 

Randomized rumor spreading is a classical protocol to disseminate 
information across a network. At SODA 2008, a quasirandom version 
of this protocol was proposed and competitive bounds for its run- 
time were proven. This prompts the question: to what extent does 
the quasirandom protocol inherit the second principal advantage of 
randomized rumor spreading, namely robustness against transmission 
failures? 

In this paper, we present a result precise up to (1 ± o(l)) factors. 
We limit ourselves to the network in which every two vertices are con- 
nected by a direct link. Run-times accurate to their leading constants 
are unknown for all other non-trivial networks. 
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We show that if each transmission reaches its destination with a 
probability of p G (0, 1], after (1 + e) ( log^i+p) log 2 n + | lnn^ rounds 
the quasirandom protocol has informed all n nodes in the network with 
probability at least 1 — n _p£//4 °. Note that this is faster than the intu- 
itively natural 1 jp factor increase over the run-time of approximately 
log 2 n + In n for the non-corrupted case. 

We also provide a corresponding lower bound for the classical 
model. This demonstrates that the quasirandom model is at least as 
robust as the fully random model despite the greatly reduced degree 
of independent randomness. 

1 Introduction 

Disseminating information in a network, that is, making information that is 
known to a single node available to all other nodes, is a classical problem. 
A simple, yet powerful approach is randomized rumor spreading, also known 
as random phone calls. In this setting, each node that is already informed 
participates in the dissemination process by randomly calling neighbors and 
passing along copies of the information. Besides being self-organized, this 
approach has two crucial advantages, (i) It is fast. For many important 
network topologies, O(logn) rounds suffice to inform all n nodes with high 
probability, (ii) It is robust against transmission failure. Often, a constant 
fraction of independently chosen transmission failures does not cause serious 
problems, but merely increases the time required by a constant factor. 

Success of the basic randomized rumor spreading protocol has motivated 
the study of several variants. At SODA 2008, a quasirandom version of 
the protocol was proposed. This version is structurally simpler, uses less 
randomness, and is especially beneficial in that each vertex contacts each 
neighbor at most once. Nonetheless, most run-time guarantees known for 
the classical model still hold for the quasirandom version, some in an even 
stronger form. However, little was previously known about the robustness of 
this model. 

In this paper, we offer a detailed investigation of the robustness of the 
quasirandom protocol. We use the following model of lossy communication, 
which was analyzed in |HKP+05j . We assume that each message reaches its 
target with a certain probability p G (0, 1) independently for all transmis- 
sions. For networks in which every two nodes are connected by a link, we 
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demonstrate that this lossiness only increases the run-time by a small con- 
stant factor, which we precisely determine for each value of p. Surprisingly, 
it is smaller than I /p. For example, for p = 1/2, the run-time increases by a 
factor of 1.828 only. The same result also holds for the classical, fully random 
model. 

In addition, we show that the corresponding slow-down for the classical 
model is at least this factor. From this we conclude that the quasirandom 
model is at least as robust as the classical model. 

This is the first time, for the classical as well as for the quasirandom 
model, that results precise up to the leading constant are shown for robust- 
ness. 

1.1 Randomized Rumor Spreading 

The classical, fully random randomized rumor spreading protocol was first 
investigated by Frieze and Grimmett |FG85j . They proposed the following 
model. Let G = (V, E) be an undirected graph. At the start of the protocol, 
a single vertex s E V knows a piece of information that is to be disseminated 
to all other vertices. We say that s is informed. The protocol proceeds in 
rounds (hence it assumes a common clock). In each round, every informed 
vertex v chooses a neighbor u v G N(v) := {u G V \ {u, v} G E} uniformly at 
random and sends a copy of the information to it. This results in u v becoming 
informed, if it is not already, and in u v participating in the dissemination 
process in subsequent rounds. This process defines a random variable T s , 
which denotes the number of rounds after which all vertices in the network 
are informed, assuming that the initially informed vertex is s. The broadcast 
time T is then defined as the maximum of all T s , s G V, where the T s are 
defined over independent probability spaces. 

Frieze and Grimmett demonstrate that if the network is a complete graph 
on n vertices, the broadcast time satisfies T = (1 ± o(l))(log 2 n + Inn) with 
probability 1 — o(l). For hypercubes and random graphs G(n,p), where p > 
(1+e) ln(n)/n, Feige, Peleg, Raghavan and Upfal [FPRU90J also determine a 
broadcast time of G(logn) with probability 1—1 /n, albeit without making the 
implicit constant precise. They also provide the general bounds of 12nlogn 
and 0(A(G)(diam(G) + logn)) for arbitrary n-vertex graphs. Subsequent 
work by Sauerwald |Sau07] . Elsasser and Sauerwald |ES07] and Berenbrink, 
Elsasser and Friedetzky |BEF08j shows that the G(logn) bound also holds 
for expander graphs, Cayley graphs and random regular graphs. 
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We shall not extensively discuss the practical side of the randomized 
rumor spreading protocol. We refer the interested reader to the aforemen- 
tioned paper [FPRU90J as well as the paper by Karp, Shenker, Schindelhauer 
and Vocking [KSSVOOJ for a general discussion, or the works of Demers et 
al. |DGH + 88j and Kempe, Dobra and Gehrke [KDG03| for particular ap- 
plications. What are generally recognized as the three key advantages of 
randomized rumor spreading are speed (logarithmic broadcast time on im- 
portant network topologies); self-organization (there is no central authority 
involved); and robustness against transmission failure. Contrary to broad- 
cast times, significantly less work has been done to quantify the robustness 
of the randomized rumor spreading protocol. 

As far as we are aware, the only results on the robustness of randomized 
rumor spreading are due to Elsasser and Sauerwald [ES09j. They consider 
the model where each transmission does not reach its destination with failure 
probability 1 — p, and failures in different rounds are independent of each 
other. They assert that the broadcast time for all graphs in this lossy model 
is at most a factor of 0(l/p) larger than in the model without transmission 
failures. 

1.2 Quasirandom Rumor Spreading 

The above results show that randomized rumor spreading is a very powerful 
approach to dissemination problems. However, taking all decisions indepen- 
dently at random also has some unwanted effects. For example, a vertex may 
contact one of its neighbors twice before contacting all of its other neighbors. 
This may only be a minor problem for dense graphs like the complete graph, 
but for sparse graphs, it may increase the broadcast time significantly. 

Let G be a star on n vertices, i.e., a graph with one central vertex such 
that all other vertices have this vertex as their only neighbor. Clearly, any 
dissemination process where each vertex can send out at most one transmis- 
sion per round needs at least n — 1 rounds, simply because the central vertex 
has n — 1 neighbors that cannot be informed by other vertices. However, due 
to the coupon collector effect, randomized rumor spreading on the star needs 
0(nlogn) rounds. 

This type of imbalance in informing one's neighbors may be avoided by 
choosing the destination of the current transmission uniformly at random 
from those neighbors which have not yet been contacted by the originat- 
ing vertex. However, this requires tracking all previously sent messages and 
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is therefore less desirable. Motivated by the paradigm of quasirandomness, 
Friedrich, Sauerwald and the first author |DFS08] suggest the following quasi- 
random rumor spreading protocol. 

In this model, each vertex is equipped with a cyclic permutation of its 
neighbors. As before, the protocol proceeds in rounds, and all informed ver- 
tices participate in the dissemination process. However, each vertex only 
directs its first transmission to a random neighbor. Subsequently, it informs 
the successors of the first addressee on its list. We shall not make any as- 
sumptions about the structures of these cyclic lists. 

Before analyzing the quasirandom protocol, let us discuss it from an im- 
plementation point of view. From a theory perspective, we immediately note 
that the quasirandom model requires each vertex to store the permutation 
of its neighbors, which may utilize up to O(nlogn) bits. This is not nec- 
essary for the fully random model. However, we may assume that in most 
networks each vertex already has some list or array of its neighbors, since 
the information regarding how to contact a neighbor must be stored some- 
where. In this scenario, the use of the lists does not increase the complexity. 
Rather, it appears that the quasirandom protocol needs less resources. In 
particular, it requires significantly fewer random bits. This is beneficial if we 
consider randomness costly, and useful if we want to trace an actual run of 
the protocol. 

The core question to be answered is whether the quasirandom protocol 
works well even if we are not permitted to design the lists. Surprisingly, the 
answer is yes. 

For all lists that can be present at each vertex, O(logn) rounds suffice with 
high probability to inform all the vertices of a complete graph K n , a hyper- 
cube Q n , an expander graph on n vertices (some extra conditions are needed 
here), or a random graph G(n,p) with p > (1 + e)(\nn)/n [DFS08, DFS09J. 
Naturally, the lower bound of log 2 n rounds valid for the fully random model 
also holds for the quasirandom model. Once again these bounds fall within 
the right order of magnitude. 

Sharper bounds analogous to those defined by Frieze and Grimmett 
are known for the complete graph. In [ADHP09J, it is shown that with 
probability 1 — o(l), the number of rounds needed to inform all vertices is 
(1 ± o(l))(log 2 n + Inn). 

In some settings, we observe better broadcast times than in the classical 
model. One example is the random graph with edge probability p = (In n + 
uj{l))/n only minimally above the connectivity threshold. Nevertheless, with 



5 



probability 1 — o(l), the random graph is such that with high probability the 
quasirandom protocol needs only O(logn) rounds independent of the starting 
point. This is a notable advantage over the fully random model. Feige et 
al. [FPRU90] demonstrate that for p = (Inn + O (log log n))/n, the random 
graph with probability 1 — o(l) is such that 0(log 2 n) rounds are necessary 
to spread the rumor with high probability. 

The bounds obtained for arbitrary graphs are also superior for the quasi- 
random model. For the fully random model, we saw above that 12n In n and 
0(A(G)(diam(G) + logn)) rounds suffice to inform all vertices of an n-vertex 
graph G with high probability |FPRU90] . For the quasirandom model, it is 
easily proven that after 2n — 3 or A(G) diam(G) rounds, all vertices are 
informed with probability one. 

1.3 Robustness of the Quasirandom Protocol 

The above results show that the broadcast time of the quasirandom rumor 
spreading protocol is quite well understood. Together with the experimental 
investigation [DFKS09J, all results indicate that the quasirandom protocol 
achieves comparable or better broadcast times than the random model. For 
the equally important aspect of robustness, much less is known. Since it 
would typically seem that robustness of randomized algorithms is caused by 
the large number of independent random decisions taken by the algorithm, 
one may conclude that the quasirandom protocol is less robust. 

The experimental evaluation in [DFKS09] debunks this assertion. For 
both the hypercube and the complete graph on 2 12 vertices, it was observed 
that if messages sent across the network using either protocol get lost with 
probability |, the broadcast time increases by a factor of between 1.8 and 
1.9. 

The only theoretical result pertaining to robustness is the one in [DFS09J. 
Let G be a graph, T G N and 7 > 1 such that the quasirandom protocol 
independent of the starting vertex with probability 1— n~ 7 succeeds in inform- 
ing all other nodes within T rounds. Then in the presence of transmission 
failures (independently chosen with probability 1 — p), independent of the 
starting vertex, 4j(l/p)T rounds of a modified quasirandom protocol suffice 
to inform all vertices with probability 1 — IrT 1 . The modification of the 
protocol needed to prove this result is that the recipient of a message returns 
a feedback message to the sender (which also gets lost with probability p). 
Whenever the sender does not receive a feedback message, he tries to reach 
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the same addressee in the next round. With this modification, however, the 
result is slightly weaker, in particular, because the feedback modification 
makes the protocol significantly less simple. 

In addition, the robustness result in [DFS09J leaves room for constant 
factor differences between the random and the quasirandom models in the 
presence of transmission faults. 

To gain a deeper understanding, we study the robustness of quasirandom 
rumor spreading (without the feedback modification) on the complete graph 
in this work. Recall that the complete graph is the only graph for which a 
broadcast time of one of the two models precise up to the leading constant 
is published. We show the following main result. 

Main Result: For all e > and p G (0, 1] the quasirandom ru- 
mor spreading protocol with arbitrary lists, despite independent message 
losses occurring with probability 1 — p, succeeds with probability at least 
l- n -'P £ / m in informing all other vertices from a given vertex in time at most 
(l + e)(log 1+p n+±lnn). 

This result is interesting for two reasons. Firstly, it shows that the quasi- 
random protocol is even more robust than previous results indicate. Note 
that the above bound is strictly better than (l/p)(l + o(l))(log 2 n + Inn), 
that is, (1/p) times the bound for the case without faulty transmissions. 

Secondly, our results imply that the quasirandom protocol is at least as 
robust as the classical one. To prove this, we show a corresponding lower 
bound for the fully random protocol. 

We should add that our proof for the upper bound of the quasirandom 
model can be modified to yield a corresponding proof for the classical proto- 
col. This is the first bound to make the robustness of the classical protocol 
precise up to the leading constant. 

1.4 Organization of the Paper 

In Section [2J we collect some well-known probabilistic tools we require for the 
remainder of the paper. We proceed in Section 3 by analyzing the randomized 
rumor spreading model, as this is shorter and easier and may serve as an 
introduction to the techniques we use. We provide a lower bound on the 
broadcast time. We then analyze the quasirandom version of this model in 
Section 4. This is the main part of our proof. Our goal is to show that the 
rumor is disseminated in the quasirandom model at least as quickly as it is 
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in the random model, and so in this section we will focus on determining an 
upper bound on the broadcast time. 



2 Probabilistic Tools 

A basic tool that we will use in the following proofs is the Chernoff 
bound |Che52j . This provides exponentially small bounds for the proba- 
bility that a binomially distributed random variable deviates significantly 
from its expected value. A similar version of the classical result can be found 
in |MU05j . We will use it in the following form. 

Lemma 1 (Chernoff Bounds). Let X\, . . . ,X n be independent random vari- 
ables, taking values in [0, 1]. Let X := Y^i=i an< ^ ^ ^ e (0> -*-]■ Then 

Pr (X < (1 - 5) EpO) < e~ s2 E(x)/2 , 

and 



Pr(X > (l + 5)E(X)) < e 



-S 2 E(X)/3 



There are places where we would like to use Chernoff bounds, but we do 
not have independence of the random variables. Here results of Panconesi 
and Srinivasan |PS97l ISriOl] show that we may use the classical Chernoff 
bounds even under the more general assumption that the random variables 
are negatively correlated. This is defined as follows. 

Definition 2. The random variables X%, . . . , X n , taking values in {0, 1} ; are 
called negatively correlated, if for every subset I C {1, . . . , n} we have 



and 



Pr (/\X 4 = l) < JJPr(X i = l), 

\iei J iei 

Pr(/\JQ = 0) < J]Pr(X 4 = 0). 

Vie/ / iei 



The results of Panconesi and Srinivasan |PS97j, ISriOl] yield the following 
lemma. 
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Lemma 3 (Chernoff Bounds for negatively correlated random variables). 

Let X\, . . . , X n be negatively correlated random variables, taking values in 
{0, 1}. Let X := Y%=i x i and let S e (0. !]■ Then 



Pr(X < (1-6)E(X)) < e 



5 2 E(X)/2 



and 



Pr (X > (1 + 5) E(X)) < e - s2E{x)/ \ 



Later in this paper we will apply the inequality by Azuma |Azu67j . Intu- 
itively, it provides strong bounds on the probability that a function defined 
on a set of independent random variables deviates significantly from its ex- 
pectation, when the value of the function is affected only slightly by changes 
to only one of its arguments. We will use it in the following version, stated 
in |McD89l Lemma 1.2]. 

Lemma 4 ( Azuma- Inequality) . Let X±, . . . , X n be independent random vari- 
ables, with Xi taking values in a set f2j for each i. Suppose that the (mea- 
surable) function f : Yl^=i — ^ satisfies 



whenever the vectors x and x' differ only in the ith coordinate. Let Y be the 
random variable f(Xi, . . . , X n ). Then for any t > 0, 



3 Lower Bound for Randomized Rumor 
Spreading 

In this section, we analyze the classical (fully random) rumor spreading model 
in which each informed node randomly chooses a neighbor to inform at the 
beginning of each round, but only makes successful contact with probabil- 
ity p G (0, 1]. We prove the following lower bound for the broadcast time. 

Theorem 5. Let e > and p G (0,1]. With probability 1 — e ~ n ( n£/b ) ; the 
number of rounds we need to inform all the nodes of the complete graph on 



\f(x)-f(x')\< Ci 
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n vertices using the random rumor spreading protocol with message success 
probability p is at least 



The key to this proof is to split up the rumor spreading process into three 
phases. The first phase is composed of the rounds that occur between the 
start of the process and the end of the first round at which point n e l 2 nodes 
are informed. The second phase begins directly after Phase 1 terminates, 
and continues until the end of the first round after which n/4 nodes are 
informed. Within each round of Phase 2, the number of informed nodes will 
grow by a multiplicative factor. The last phase begins directly after Phase 2 
terminates, and continues until all the nodes are informed. In this phase we 
observe a type of coupon collector process. 

In order to establish the lower bound posited above, we give lower bounds 
for the durations of Phases 2 and 3. By N t we will denote the set of vertices 
that are newly informed at a given time-step t. Similarly, by I t we will denote 
the set of vertices informed after time t. 

Lemma 6. Let e > 0. With probability 1 — e ~ n ^ n£/6 \ we need more than 
(1 — e) log 1+p n rounds to complete Phase 2. 

Proof. Let t\ denote the number of rounds needed to inform the first n £ / 2 
nodes. Note that this means that n e l 2 < |J tl | < 2n £ / 2 . Let t > t±. We have 
E(|iVt + i|) < p\h\- Enumerate the nodes in I t from 1 to \It\, and define the 
indicator random variables X±, . . . X\i t \ such that 



In this context a successful contact refers only to the transmission of 
the rumor, regardless of whether or not the contacted vertex was already 
informed or is also contacted by another vertex. Therefore the random vari- 
ables X±, . . . , X\i t \ are independent. 

If X := J2i Xi, then E(X) = p\h\- It is intuitive that |iV t+ i| < X, because 
X not only counts all the nodes in N t+ i, but also counts nodes multiple times 
if they are contacted by multiple nodes, and counts nodes that are contacted 
in round t+1 that have already been informed in previous rounds. Therefore, 





1 if vertex % has successfully contacted another vertex 
otherwise. 
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any upper bound we can find on the size of X also holds as an upper bound 
for the size of N t+1 . But because of the independence of Xi, . . . ,X\j t \, the 
random variable X is a lot easier to handle. 
Using Chernoff bounds, we see that 

Pr (X > (1 + n- £/& ) E(X)) < exp (-|n" e/3 p|/ t |) 

< exp (-|n" e/3 pn e/2 ) 

= exp (— |pn £ / 6 ) 
= e -n(n«/«)_ 

So with probability 1 — e~ n ( ne/6 \ the number of nodes informed after t + 1 
rounds satisfies 

|/ml < \It\ + (l + n~ e / 6 )p\I t \ < (l +n - £ ^)(l+p)\I t \. 

We can therefore infer by using recursion that for every fceNwe have 

\It +k \ < (l + n~ £ ' e ) k (l+p) k \I t \ 

with probability 1 — ke" n ^ nS/6 \ Pick k — (1 — e) log 1+p n. Under the assump- 
tion that n is sufficiently large, we compute 

lW- e )iog 1+J> n| < (l + n-^-^+^il+p)^^^ 2 
= (1 +^ e/6 ) (1 ~ e)logl +f n n 1 - £ 2n £/2 

< exp (n- £/6 (l - e) log 1+p n) 2n 1 ~ e/2 

< In 1 -*' 2 

< n/4 

with probability 1 — e - n (" e/e ). So (1 — e) log 1+p n rounds are, with probability 
1 — e -^(™ £/6 ) ) not enough to complete Phase 2. □ 

Lemma 7. Let £ > 0. Mi/j probability 1 — e - n (™ £ ) ; y;e need more than 
(1 — e)- Inn rounds to complete Phase 3. 
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Proof. Let t 2 denote the number of rounds needed to inform the first n/4 
nodes. This means that we have n/4 < \I t2 \ < n/2. Let v be a node that is 
still uninformed by time t 2 . Then 



Pr(node v is uninformed at time t 2 + k) > ( 1 



» fc(n-l) 
p \ 



n — 1 



Enumerate the uninformed nodes at time t 2 from 1 to |V\It 2 |. Define the 
indicator random variables X%, . . . , X\ V \i t2 \ such that for i e {1, . . . , |V\/t 2 |}, 
we have 

{1 if node i is uninformed at time t 2 + (1 — e)-hm 
v 
otherwise. 

Let X := ^l=i 7 ' 2 ' This is the number of uninformed vertices at time 
t 2 + (1 — e)Mnn. Since \V \ h 2 \ = n — \It 2 \ > ti/2 and because for small 
enough x > we have 1 — x > e~ x ~ x2 , we obtain for the expected value 

\v\h 2 \ 

E(X) = Pr (^ = !) 



n / p 
> - 1 



(n-l)(l-e)ilnn 



2 V n - 1 

> — exp I —(1 — e) \nn (1 — e) Inn 

2 \ n — 1 

> l ne -(l-e)lnn 

= ±n e 

again assuming that n is sufficiently large. 

The random variables X 1 , . . . , X\ V \i t2 \ are negatively correlated. Hence 
by Lemma [3] we get 

Pr(X = 0) < Pr (X < (1 - |) E(X)) < e ~ E(x)/8 < e~ n£/32 . 

So with probability at least 1 — e~ n£//32 , we see that (1 — e)^ Inn rounds 
are insufficient to complete Phase 3. □ 

Proof of Theorem 1. Follows immediately from Lemmas 2 and 3. □ 
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4 Upper Bound for Quasirandom Rumor 
Spreading 

In this section we analyze the quasirandom counterpart of the rumor spread- 
ing model described in the previous section. This model differs from the 
random model in that each vertex is equipped with a cyclic list of its neigh- 
bors and only chooses the first neighbor it attempts to contact at random. 
After its initial choice, each vertex subsequently attempts to contact the 
remaining vertices in the order of its list. Each of these attempts is indepen- 
dently successful with probability p. Note that we do not assume that the 
sender is notified of a transmission failure. 

Our goal is to prove that the rumor in the quasirandom model spreads at 
least as quickly as in the random model. To this aim, we prove the following. 

Theorem 8. For every e > and p £ (0,1], the number of rounds we 
need to inform all the nodes of the complete graph on n vertices using the 
quasirandom rumor spreading model with message success probability p is at 
most 



Unfortunately, since the rumor spreading process is saturated with many 
dependencies, determining the run-time for the the quasirandom model is not 
straightforward. As in [DFS08J, we try to overcome this difficulty by suitably 
simplifying the random experiment, in particular, by assuming that certain 
vertices stop informing (ignoring), and that other vertices do not immedi- 
ately start their own informing process after becoming informed (delaying) . 
Delaying turns out to be useful as it gives us some influence on when a vertex 
uses its one random choice. Nodes that have been informed but have not yet 
begun informing new nodes play an important role in our analysis. We call 
them newly informed vertices. 

To obtain bounds that are precise up to the leading constant, however, 
we have to be careful that our delaying and ignoring techniques do not slow 
down the rumor spreading process too much. For this reason, we partition 
the set of rounds that are necessary to inform all the nodes in the graph into 
two different types of phases. For both types of phases, the set of nodes that 
are initially active is the set of newly informed nodes. 




with probability at least 1 — n 



pe/40 
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Lazy phases were also used in the time analysis of |DFS08j . Only nodes 
that are considered active at the beginning of the phase are considered active 
for the remainder of the phase. Nodes that are contacted during the phase, 
although they are still considered to be informed, remain inactive, and are 
therefore unable to spread the rumor themselves for the continuation of the 
phase. 

Since lazy phases neglect the rumor spreading potential of a significant 
portion of the nodes, we also need busy phases. Here, all nodes informed 
during the busy phase are active for the remainder of the phase. In other 
words, nodes newly informed during the busy phase have the ability to spread 
the rumor in each subsequent round until the termination of the phase. By 
choosing the lengths of the busy phases suitably, we balance the difficul- 
ties with the inherent dependencies and the losses due to ignoring informed 
vertices at the end of each phase. 

As a result of implementing phases in which vertices that can spread the 
rumor in the original model are now inactive, we are only delaying the point 
in time at which all the vertices are informed. Therefore, the upper bound 
for the quasirandom model with lazy and busy phases holds as an upper 
bound for the original quasirandom model. 

We will split the rumor spreading process into lazy and busy phases in 
the following way. We start with two lazy phases of ~£ Inn rounds each. The 
main purpose of these two phases, which are easy to analyze, is to inform a 
set of vertices that is sufficiently large enough to maximize the effectiveness 
of the subsequent busy phases. We then perform a logarithmic number of 
busy phases, each composed of a constant number of rounds. This process 
results in a constant fraction of informed nodes, and we only need two more 
lazy phases to render the entire network informed. 

Let I t denote the set of vertices that are informed at a given time-step t. 
Similarly, we will denote the set of newly informed vertices at time t by N t . 

4.1 The First Lazy Phase 

The first lazy phase lasts for |e Inn rounds. Our goal is to prove the following. 

Lemma 9. Let e > 0. After one lazy phase of length |elnn, at least |pelnn 
nodes are newly informed with probability at least 1 — n~ pe ^ m . 

Proof. Let t\ := ^e\nn. At time t — one node, Vq, is informed. We perform 
a lazy phase of length t\. This means that vq contacts each of the first t\ 
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nodes from its list with probability p. Therefore, 



E(|JV tl |) = \pe\nn. 
Using Chernoff bounds we see that 

Pr(|i\y < \pe\nn) =Pr(|JV tl | < (l - \) E(|iV tl |)) 
<exp(-E(|JV tl |)/18) 
= exp (— pe lnn/36) 
= n" p£/36 . 



□ 



4.2 The Second Lazy Phase 

The second lazy phase begins at time t\ + 1 and terminates after ^6 Inn 
rounds. Our goal is to prove the following. 

Lemma 10. Let e > 0. If, at some point t\ in our model, we have ^pelnn < 
I -Wtil < \s\nn and \I tl \ < |elnn + 1, then after one lazy phase of length 
^elnn, at least (|pelnn) 2 nodes are newly informed with probability at least 
1 — n~ 7 for any 7 G [0, 1). 

Proof. Let t\ G N be such that |pelnn < |JV tl | < \e\a.n and |/ tl | < |elnn + 
1 and let t 2 :— t\ + |elnn. Enumerate the nodes of N tl from 1 to \N tl \, and 
impose an artificial ordering on the set so that each node i calls \e In n of 
its neighbors determined from its cyclic list and its initial random decision 
before node i + 1 attempts any contact. For each i G {1, . . . , |iV tl |}, let Xi be 
the indicator random variable of the following event: Vertex % informs none 
of the vertices which were already informed by a vertex from 1 to % — 1, nor 
a vertex from I tl . In other words, if Xj = 1 for all % G {1, . . . , |iV tl |}, then 
|AT t2 | is equal to the number of contacts made during this phase. 

When vertex i first attempts contact, at most \I tl \ {i}\ + — l)e Inn < 
\%e In n other vertices are already informed. Therefore, 

(\e\nn) (he Inn) (^elnn) 3 , 

Pr (Xi = < ^ '-^ '- < ^ '—. 1 

n — 1 n — 1 
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Using a simple union bound, we conclude that 



.,\N tl \}:Xi = 0) 
0) 



Now that we have shown that the chances of contacting an already in- 
formed vertex in this phase are sufficiently small, all that is left to do is to 
determine how many contacts are made during the phase. 

Every node in N tl attempts to contact \e In n nodes, so there are 
\e\an\N tl \ possible contacts made during the phase. Each of these is in- 
dependently successful with probability p. Let Y be the random variable 
denoting the number of contacts that are actually made during the phase. 
Then we have 

E(Y) = \pe\nn\N tl \ > (\ps\nn) Qpelnn) . (2) 
Using Chernoff bounds, we see that 

Pr (y < (|pelnn) 2 ) < Pr (Y < (l - §) E(Y)) 

< e -E(y)/is 

< e -(pelnn) 2 /108 
_ „-pV(lnn)/108 



Pr(V^ G {1, . . . , \N tl |} : X t = 1) = 1 - Pr(3^ G {1, . . 

\N H \ 

> i - Pr (^ = 
i=i 

n- 1 



Therefore, at least (|pelnn) vertices are informed during this phase 
with probability at least 1 - Illi^L _ n V£ 2 (inn)/ios > 1 _ n - 7 f or any g xec i 
7e[0,l). " □ 



4.3 The Busy Phases 

A sufficient number of nodes are informed of the rumor in the first lazy 
phase, and so we are ready to commence the set of busy phases. As we have 
mentioned earlier, the idea of these phases is that nodes informed during 
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each busy phase are able to spread the rumor during subsequent rounds of 
this phase. Because of the dependencies, these phases require a more refined 
analysis. Our goal is to inform a constant fraction of the nodes in the network 
by the time we complete this sequence of phases. 

4.3.1 The Analysis of a Single Busy Phase. 

In order to determine the cumulative effect of the busy phases, we must first 
analyze the impact of a single busy phase composed of k rounds starting 
after time-step t. The theorem we present below is the heart of the precise 
analysis of the quasirandom model. The idea of the proof is to investigate 
the part of the process originating from each single node in N t . A single such 
process can be analyzed with moderate difficulty Unfortunately, there may 
be "conflicts" among these partial processes, that is, several of these partial 
processes may inform the same node, possibly at different times. However, 
we show that only few of these conflicts occur. By completely ignoring all 

parts that are contained in a conflict, we manage to analyze the busy phase. 

2 fe-i 

Let e' > 0, k G N, p G (0,1] and C' < ^{2e)'^^^~ k ~ 1 . We will 
prove the following statement. 

Theorem 11. Let ieN such that in our model at point t we have \N t \ > 
(pe'lnn) 2 and \I t \ < (,'n. For any c > 0, if we perform a busy phase of length 
k, then at the conclusion of this busy phase, the number of newly informed 
vertices satisfies with probability at least 1 — n~ c the inequality 

\N t+k \>p(l+p) k ~ 2 \N t \. 

Proof. Let ( := 2 k ('. Let t be such that we have \N t \ > (joe' Inn) 2 and 
|it| < C'n. Note that 

\h+k\ < (n. 

Enumerate the nodes of N t from 1 to |iV t |. For each % G {1, . . . , |-/V t |}, we 
define the set of descendants of i, denoted Di, such that for any node v G V, 

1. If v is contacted within rounds t + 1, . . . , t + k by i, then v G Di, and 

2. If v is contacted within rounds t + 1, . . . , t + k by u and u G Di, then 
v G Di. 

We say that set Di is conflict-free if the following two conditions hold for 
each node v G Di. 
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1. There exists exactly one node u G A U {i} that attempts to contact v 
within rounds t + 1, . . . , t + k. 

2. No node w G I t +k \ (A U {0) contacts v before round t + k + 1. Note 
that this implies that the node u specified in condition 1 informs v of 
the rumor if it makes contact. 

Otherwise, we call A conflicting. 

We will now determine the likelihood that for all % G {1, . . . , |jV t |}, the 
set A is conflict-free. 

The first condition of conflict-freeness demands that for each node v G A 
there exists exactly one node u G A U {i} that attempts to contact v within 
rounds t + 1, . . . , t + k. To bound the probability that this condition fails, 
we impose an ordering on the random decisions of the vertices in Di U {i}. 
For every such decision d, the probability that d creates a conflict with any 
previous decision, i.e., that the node in question attempts to contact a node 
that is targeted by a different node in Di U {i}, is bounded from above by 
^-2 fc . So the probability that among all decisions a conflict is created is 
bounded from above by ^-(2 fc ) 2 . 

The probability that the second condition of conflict-freeness fails is 

Pr (Di n (£>! U • • • U A-i U A+i U • • • U D m U I t ) ^ 0) . 

Let d be a random decision of a vertex in Di U {i}. 

For all outcomes of random decisions up to round t + k other than those 
of vertices in A U {?}, the probability that d creates a conflict with any such 
decision can be uniformly bounded from above by (£n — 2 fc ) So 

Pr (A n (A u • • • u A-i u A+i u • • • u D m u i t ) ^ 0) 

<|D i U{i}|.(Cn-2*)^ I 
<2 fc (Cn-2 fc )^. 

Using a union bound, the probability that Di is conflicting is bounded 
from above by ^(2 fc ) 2 + 2 k ((n - 2 k ) ^ = 2 fc (n^ < 2 k+1 (k. 

Now we want to bound the number of conflicting sets of descendants from 
above. Let q = p ^t-i — • Then the probability that there are at least q\N t \ 
conflicting sets of descendants is 
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Pr (Vi G M : is conflicting) < | J Pr (Dj is conflicting) 

MCNt MCN t i£M 

|M|>g|AT t | \M\>q\N t \ 

< Yl (2 fe+1 A:C) |M| < 2^*1 {2 k+1 K) qm 



MCNt 
|M|> g |JV t | 



I2(2 fe+1 A;C) 2fe " 1 ) < e-'^l < n 



for any c > 0. 

Let Xj C Di denote the set of vertices that are descendants of % and are 
contacted in round t + k. We have 

\N t \ \N t \ 

\N t+k \> \ X *\ = (1^1 I A is conflict-free) 

2=1 Z = l 

is conflict-free is conflict-free 

ijv*i \m 

= ^ |A is conflict-free) — ^ (|Xj| |Dj is conflict-free) . 

i=i «=i 

is conflicting 

Since the largest number of descendants of i that can be informed in round 
t + k is 2 k ~ 1 , this yields that 

|M| |JVt| 
\N t+k \ > ^2(\Xi\ | A is conflict-free) - 2 fc ~ 1 . 

i=l »=1 

is conflicting 

We can say the following about the expectation of for conflict-free sets of 
descendants. 

Claim 12. We /iave 

E(\Xi\\Diis conflict-free) =p(l+p) k ~ 1 . 

Proof. We prove this claim by induction on the number of rounds that have 
occurred. Assume that is conflict-free. At time t + 1, the probability 
that % has successfully contacted a new node is p, and therefore the expected 
number of nodes informed by i is p. Let j & {1, ... ,k — 1} be such that for 
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all r G {1, . . . ,j} the expected number of descendants of i informed in round 
t + r is p(l + p) r_1 . 

Then the expected number of descendants of i informed in round t+j + 1 

is 

□ 

Let X := Yl\=i is conflict-free). Note that the conditional ran- 

dom variables (X^Di is conflict-free) are independent. As 



Pr (X < p{l + p + p 2 )(l + #- 3 |iV t |) 
= Prfx<p(l + j9) fc - 1 |iV t | fl ; ' 



1+p) 



Pr I X < E(X) I 1 



(1+p) 

^pr ( ^<^ri P 



2 k ^ 2 k ~ 1 V 

using Chernoff bounds on the random variables i^-^A 
yields 



Dj is conflict-free 



Pr (X < p(l + p + p 2 )(l + p) k ~ 3 \N t \) 
< ex P -7^7^ — U = exp 1 



2 fc (l + pY J 1 \ 2 k 



< n 



for any c > 0. 

This means that we have 

X >p(l+p + p 2 )(l+p) k ~ 3 \N t \ 

with probability at least 1 — n~ c . 
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fc-1 



So with probability at least 1 — n 

\Nt\ 

\N t+k \>X- 2 

i=l 

£)j is conflicting 

> p(l + p + p 2 )(l + p) fc - 3 |AT t | - p 3 (l + p) k ~ 3 \N t \ 
= p(l + pf- 2 \N t \. 

This proves Theorem [TTJ □ 

We also have to ensure that after the performance of one busy phase, a 
big enough fraction of the informed vertices is newly informed, since only 
the newly informed vertices are active in the next phase. We show that this 
holds in the proof of the following corollary 

Corollary 13. Define t such that at point t we have \N t \ > (pe'lnn) 2 and 
\I t \ < min jc'^, p ( 1+ 2 p)I-2_x \Nt\ j ■ If we perform a busy phase of length k, 
then for any c > at the conclusion of this busy phase we have 

2 k - 1 

with probability at least 1 — n~ c . 

Proof. If we perform a busy phase of length k, by Theorem [TT1 we get 

k 



i=i 



i=i 



p(l + p) 

(2 fc -l)p(l+p) 



i=i 
iV t | 



p(l +p) k ' 2 - 1 

2 fc - 1 
^ p(l+p)*-2-l |iV ' +fc| 

with probability at least 1 — n~ c for any c > 0. □ 
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4.3.2 Assembling of the Busy Phases. 

Now that we have analyzed a single busy phase, we can put these phases 
together to obtain a constant fraction of informed nodes. Let e > 0, p £ (0, 1] 
and 

* := — ( log 1+P - + 2 

£ \ y p 

„fc-i 

As in the previous section, let ( < £(2e) p 3 (i+p) fc - 3 ' ( and (' := 2~%. We 
show the following. 

Theorem 14. Lei e' > 0. Let t 2 be such that in our model at point t% we 
have \N t2 \ > (pe'hin) 2 and |J t2 | < min jc'n, p(1 ^ ) 'fc-a_ 1 Wt 2 | } ■ 

Lei £ denote the smallest integer such that if we perform £ busy phases 
with k rounds we have |J* 2 +^| > C' n - 



T/ien 

£< 



(1 + e) log 1+p n 



and 

2 k - 1 

l J * 2 +^l - p(i +p )fc~2_ 1 l-^+ttl 
/ioW wii/i probability at least 1 — for any c > 0. 

Proof. Since p(l +p) fc ~ 2 > 1 and |A^ 2 | > (joe' Inn) 2 , we can use Theorem [TT] 
inductively and obtain that for all s £ {1, ...,£} we have 

|iV t2+sfc | >p(l+p) fc - 2 |A^ 2+(s _ 1)fe | > ... > (p(l+p) fc - 2 ) s |iV, 2 | 
with probability at least 1 — sn~ c for any c > 0, and therefore 

\It 2+ ik\ > {p(l+p) k - 2 )\ 

with probability at least 1 — £n~ c for any c > 0. 
Since n> (n> |/ t2+ ^|, we have 

log 1+p n > log 1+p Cn 

>log 1+p (p(l+p) fc - 2 )' 
= £(fc-2 + log 1+p p), 
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which implies that 

log 1+p n (1 +e)\og 1+p n 



£ < 

k -2 + log 1+p p k 

with probability at least 1 — rT c for any c > 0. 

Since |J t2 | < p n+ p \k-s_i \Nt 2 \, by an inductive application of Corollary 
we get that 

2 k - 1 

l J * 2+ ^l - p (l +p )k-2_ I \ N t2+tk\ 

holds with probability at least 1 — n~ c for any c > 0. □ 
4.4 Second To Last Phase 

Now that we have a small constant fraction of newly informed nodes, a lazy 
phase of a constant number of rounds suffices to yield a large fraction of 
newly informed nodes. 

Lemma 15. Let e E (0, 1) and k : = ^ ^log 1+p i + 2^. Lei t 3 6e swc/i that 

in our model at round t 3 we have \I ts \ < p n^I-2 _! |-^t 3 |; ° n< ^ t/iere einsi 

C,C £ (0, 1) suc/i tfia* C'" < 141 < /ioWs. Let S := 2fcl g /C) . 

yl/ier one lazy phase of S rounds starting at time t 3 , at least (1 — 3()n 
nodes will be newly informed with probability 1 — e~ n i n ) , 

Proof. We perform one lazy phase of 5* rounds starting at time £3. Let 
v eV\I t3 . Then 



Pr(no v E N t3 contacts Vq in this phase) = ( 1 

/ P S\N t3 \ \ ( Sp(p(l+p) k - 2 -l)\I t:i \ 

< exp < exp ' 



n - 1 J ~ 1 V 2 fc (n - 1) 

/ Sp\I t3 \ \ f Sp('\ t 

We now calculate the expected number of newly informed nodes after S 
rounds. Let £4 := t 3 + 5*. 
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E (|iV t4 |) — \V\It 3 \ ■ Pt(v EV\It 3 is informed in this phase) 
>(n-|/J)(l-C) 

> (n - Cn) (1 - C) 

> (1-2CK 

We will now use the Azuma-Inequality (see Lemma 0J. Number the nodes 
of iVtg from 1 to | N t:j \ . Then for all i G { 1 , . . . , | N t:j | } , define the random 
variable X{ as the set of vertices that i contacts in the S lazy rounds. Now 
we can define the function / such that 



f(X 1 , . . . , X\ Nt \] 



By this definition, we see that 



\Nt 3 1 



t?, 



8=1 



/(a?i, . . . . . . ,X|jv t ,|) - /(si, 



X\N t j) < S. 



Therefore, we can calculate the probability that we inform less than (1 
3()n vertices in this phase. 



Pr(|i\y < (l-3C)n) = Pr(|JV t4 |<(l-2C)n-Cn 

/ 2C 2 n 2 

<Pr(|iV, 4 -E(|iV 44 |)|>Cn) < 2exp -— p^— 
< 2exD ( 2 ^ n2 \ - e -n(n) 



□ 



4.5 The Final Phase 

The last phase of the protocol is again a lazy phase. We now use the large 
fraction of newly informed nodes from the previous phase to inform the few 
remaining nodes. 
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Lemma 16. Let e G (0,1) and r] < §. Let t 4 be such that in our model 

at round t 4 we have \N t4 \ > (1 —rj)n. After one lazy phase of ^^lnn 
rounds starting at time t±, all the nodes will be informed with probability 
l-Oin-^V 12 ). 

Proof. We will perform one lazy phase of In n rounds starting at time 
t 4 . Let Vq G V \ It 4 - Then 

, n ( p(3 + £)lnr^^ |iYt4 ' 

Prfno v G N t , contacts vq m this phase) = 1 — 

V M n ~ 1) 

f-(3 + e)\nn\N u \\ ^ (-{3 + e)(l - rf)n\nn 
< exp — — < exp 1 



3(n-l) / ~ *V 3(n-l) 

< exp( -( 3 + g )( 1 3 -^ ln ^ < n -d + e(l-,)/12)_ 

So the probability that all the nodes become informed is 

Pr(Vu G V \ Jt 4 : v becomes informed) 

= 1 — Pr(3n G V \ It A ■ v does not get informed) 

> 1 — Pr(f does not get informed) 

v£V\h 4 



□ 



4.6 Proof of Theorem [8] 

Let e G (0,1), p G (0,1] and k := (logi +p ^ + 2 J . Furthermore, let 

{ 2 fe-i ^ 
I(2e)~p 3 ( 1 +") fc - 3_fc " 1 , ^ > and C' := 2~ fc C 

We start a delayed quasirandom rumor spreading protocol with message 
success probability p and with one initially informed vertex. We first perform 
one lazy phase of length t\ := -^ehxn. By Lemma [9] this yields that \N tl \ > 
^pelnn holds with probability at least 1 — n~ p£ / m . Of course, after one lazy 
phase of length t\ we have with probability one \N tl \ < t\ and \It x \ < ti + 1. 
So we can apply Lemma ITUl and get |iV 42 | > (|pelnra) 2 , this phase succeeds 
with probability at least 1 — n -7 for any 7 G [0, 1). Furthermore we have 
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with probability one |J t2 | < Inn) 2 + |e Inn + 1 < ('n as well as |J, 



p(i + p)fc-2_ 1 |-^t 2 | f° r an y sufficiently large n. So we can apply 

Theorem PHI with e' := |. This gives us an £ < 1 ° gl+p - such that if we 
set ^3 := t2 + £k, then for any c > we have with probability at least 1 — n~ c 

2 k — 1 

C'n < |/ t3 | < (n and |/ t3 | < ^ - ^ k _ 2 _ - \N t3 \. 

So with probability at least 1 — n~ c the preconditions of Lemma [15] are 
fulfilled. Therefore, if we set S :— 2 and £4 := £3 + 5, we get |A^ 4 | > 

(1 — 3()n with probability at least 1 — n~ c . We can consequently apply 
Lemma [TBI with rj := 3£. We conclude that after In n more rounds all 
the nodes will be informed with probability 1 — 0(n~ £ ( l ~ £ ^ 12 ). 
Overall, we perform at most 

\e In n + (1 + e) log 1+p n + S + ^ hi n < (1 + e) ^± Inn + log 1+p n 

rounds in our delayed quasirandom rumor spreading protocol with message 
success probability p. 

The overall failure probability is at most 



4.7 Upper Bound for the Fully Random Protocol 

The proof of the upper bound provided in Theorem[8]can easily be modified to 
yield the corresponding bound for the classical randomized rumor spreading 
protocol. 

Theorem 17. For every e > and p e (0, 1], the number of rounds we need 
to inform all the nodes of the complete graph on n vertices using the fully 
random rumor spreading model with transmission failure rate 1 — p is 

(1+e) (log 1+p n + Mnn) 

with probability at least 1 — n~ p£//40 . 
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We omit a formal proof, since all the necessary arguments are used in 
the proof of Theorem [S] and can be reapplied to fit our needs. Much of our 
reasoning can be simplified significantly since we do not have to cope with 
the dependencies present in the quasirandom model. 

The only aspect in which the quasirandom model is superior to the ran- 
dom model is the first phase. Here we have the advantage that one vertex 
does not inform a neighbor more than once in time shorter than its degree. 
However, for the fully random model the chance that a vertex informs one 
or more neighbors multiple times within t rounds is only Q(t 2 /n). Hence the 
benefits of the quasirandom model are minimal. 

5 Conclusion 

In this paper, we present the first precise results pertaining to the robustness 
of randomized rumor spreading and its quasirandom variant up to constant 
factors. We showed that if the network topology is a complete graph on n 
vertices and each transmission only reaches its destination with probability 

p, then after (1+e) ^Mnn + log 1+p nj rounds, the quasirandom protocol will 

have informed all nodes in the graph with probability at least 1 — n _p£//40 . For 
p = 1, this result coincides with the known sharp bound of (l + o(l))(log 2 n + 
Inn). We also showed that the robustness of the quasirandom model is at 
least as good as that of the classical protocol. 

This work prompts the question: do we observe similar robustness behav- 
ior for other network topologies? We believe that for most natural network 
topologies, the quasirandom variant is as robust as the fully random one. 
Unfortunately, no other precise analysis up to leading constants of the run- 
time of non-trivial graph classes has been published, not even for the fully 
random model. Hence, to fully understand robustness, it is necessary to first 
conceive of a precise analysis of the classical model, e.g., on hypercubes or 
on random graphs. 
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